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I. INTRODUCTION 


As CONTROL OL DADA 


One croLiem plaguing today's information Manager a5 cure 
serious lack of control over data waica has developed as 
computers and their applications Five spread on: 
organizations.  Kecentiy, there has beer a consiceratie 
increase in the attention being paid to tiS Ons 
However, most organizations *hose information systems wcre 
Geveloped in tue 60's ani early to middie 70559 12) M NX 
the ill effects cf airpronerly centro eq Jaca ME 
environmepts, redundant, xncozsrleto, und »naconrateNde MEE 
still prevalent., Under sich Cireunsta1ces (the feo. er, 
that faulty data will civectly conti putero oes 
organizational planning ani ineffective deci- u Te a 


sigsilicantivyanckaasche 


While scme organizations khave undertaken action to 
correct their data control pro lers ae ae e S 
overwheined by the encrmity, ccaplexity, and cost of the 
task. In very latge Or janlizatıcns, tne Cost ancora 
take cn proportions that appear ee el M mu ET 
Unfortunately, it is these large organizar iorsdd aa 
the greatest need for carz2tul vcontriodio dra eee 
organizations are alsc more likely to experience adverse 
effects which extend teyond thcse found in smaller 


enterprises. 


Ore ol tnese effects Is mms UM NM ug 
position in which scote organizational acer qro SN 
© 


themselves. AS ore cog in a "Aroma cn 


Mens enc cd to wee data collected am processed by other 
Organizational elements over whem they exercise ao cortrol. 
Der ons danes in this C Toun anrd S the mesma pt anc 


Subsequent use of inaccurate Nata. 


Information systems need valid data to be effective! A 


msm asfunption by amuta procêssin element tnat ivaccurate 
data ago correct car lave devastating cffects on a parert 
Organizaticn, especially if infcrmation based on the lata is 


used for strategic planning/decision-maxin;j. 


Imc coda ao nino» vam3hey 15 bein; transíerec 


Among cata vrocessimPelenenis mthin ar organization, tac 


de 


problem is almost always a systemic one with dees and 
wldespreai roots Cu T on an organization-» ide 
pasis cíten is nejglectel because of excvessive costs. Users 
nho inc themselves in these Situations aro zrowJentéày left 
to their own devices, and they must lievoiop their own wens 
Pc Ctr O ES A E tración Mes. a User group 
experienc iny such a situation 1s9tune Oitice of the Deputy 
Ber Stu co rins Desp), USO droy “1litany 


perso aL Converters) iu Ailexsadria, VirJjirnilae 


ERrZEDESPERANSJANELPEREEN 


n III ee cen ps respensibic tor tne «brlarlde 
ns cr nonic mo roresscona! dJeverombnt of aruy officer 
Deere race are). a ALPESSCAN, DESPLANS ha: 
ISS IS Ace rama, ani executipo current an! 
pec Ee em mento i5e., matchin] nersontei inventcry 


to. Toree amd zaticn levels, 


Hej ees composed oil Give Drarches, eaci of mhicr 


Momeudn cea seemed portion GE tne force alijnnent qassior. 
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Figure 1.1 Force Plans Branch 


Each Franch uses a Series of ccaputerizei LH 
a variety of forecasting fusocticno sec Mc E UE 
exanple cf the models ani anput files usc by bese. ee 


Lranches. Many of these models are quite complex and draw 
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SEE AX men, A te h Ret mm mg Bm ms 


L aa A A e a rr P m A en m et aa" ak et re i Ee r EE B REE L LE ledh a cit eira item, co oret Br re 


Rene a ca mom Dot: Mi LIENGEYT ase non ZILPEZCER sources. 
Some input Ziles are extremely larje, feed a num 
To els, Amie nlStonlcCaliy jaye keen prone to erroz.  Sone o£ 


Quiet Nt et les aire under DCSPLANS control. 


Mmivee ciation DCSEUANs models 1s used for crucial*top 
Dovyvolelecrisiop Making which @ali detertine the structure ane 
content cf army forces well intc the zuture. AS suck, the 
DCSPLANS output must exhibit a very hijn degree or validity. 
Currently, however, DCSPLANS is unable to verify tae 
geauraey or nuch or tre irput data Leing used 2v its nolels. 
Thus, despite tie correctness of the models thenselves, tre 
Bote Os e DES LAS orogucet must he considerel 
uot rul. 


De IANS OfirelalS ave quite concernei haut their 
Present mahıılıeysto insure tnat thelllata used 12 tielr 
models are accurate. They realize the problenm will not t2 
Seven for then Soon cy the oOryganizattoun (MILPEPCEN), anú 
Cey nyúst vise their own locai solution. A variety 
Se Options are available to ther. Some are juite pocr 
iex igi cdm EE stata guo aud rely op the input Jata 
SOI ESO as ire valrinrty):; others hro nore feasikle, Ent 
Sel (CCMA Ve ser Ole SLOreCoge mis (2. 7., upcate/cemvert 
Eccc Ode tO Miclige itS Own vaildmtion rrocess) . 
A much aore effective anl efficient alternative is iescrirel 
E rs een che use of ar active cata dictiosary a5 
IO era te input Before tle Rata is processei L vV 


bo variocous'fhoiels. 


Es THPSIS METHODOLOGY 


i1 Cx lore the concept of asin an active 


poser ais da ocal validation tool: It will rreceel 
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from a jeneral review of data vaiidation, througn ar 
examination of data dictionaries ana tere Gest con 
illustration of now az active root rena 


beneficiaily applied to DCSPUANS opener cas. 


Chapter two of tho thesis crtes tas ssent MEL o 
data validation as an integral cartao cata recem 
system. Validation criteria anô technisues used in tae 
“data filter" are reviewed, and the general nature of edit 


and validation rules tse eo rem 


Chapter Three explcres the data dictionar rra 
some tasic definitions and concepts an ws Jeon ama, 
addresses how an active lata dictionary is used to validate 


datas 


Chapter Four outlines ar ar rzoachst a Vive Eee 
design of a data dictionary "fFrlterizesvsten gez 2 
also includes a recommended "checklist" of questions a user 
group can ask to define its T IOT 


requirements and system structure. 


Chapter Fivo specificallv acdresses the DCSPLANS 
Situation. It cites a proposed joal ani some key objectives 
of a DCSPLANS validation System, aná uses a nodiried 
structure dlajraga Of a "data cre: A u 


reconmenled approach to JCSPLANS" data validation MASA 


Chapter Six summarizes the results of this thesis. 


II. INPUT VALIDATION 


A. GENERAL DESCRIPTICN 


Inaccūürate data iteę1s can easily find their —™ay into 
master files and datatases, eitrer through direct inrut 77; 
Sens OBS TO 1 gm nptoreraprocecsing betions Lj mppiicaticn 
TH ear il ess or orrotn, rnaccumeete data are poison 
Sy Son nto ria tion create: irom inaccurate ita 


also tends to be inaccurate, anc decisions based user snc 


FC 1 


HT 1 EITE THEO OII HGG LOD 0 61A T 1 gaea L goals 1 

almost every instance. Data is a valuable resource, ance its 

Aeetraey 1S Cruciduimete OF janizational success: 
Vaueedation 15MBhãt setao: actionsajrhich attempts to 


precrude the existence of inaccurate data within ar 


information system. Validation tests cau be implementoe! at 
du numser os stages Within the ieta processi] cyce: 
Fa RR Ln ese ion ROA, Jurinsemrzocessiny, and after 


ELO e SS O LU PAE ME Cck =>]. "IGSput validation", as 
japierenteã by an active data dictionary systen , occurs at 
the second stage 

Du DNI UTMUNNRoCuses s:cee:rlicallv on Gata bei 
enterei into a syste&. Its aim is to ietect errors am! 
thercby insure the initial accuracy o£ the master fi 
cama case MEN Gouetrnctei/iupla tell "[RekF- 1:94. 326) Duriny 
put Aa oc ies are corênctci to insure tna 
Pole) amen erat ion itself is legal, andl tiat input luta 
doceas MOr violate prescribed accuracy constraints. Creation 
of a new file or the update of an existiaj one is a 
ata 


yalı Jlation, especially in those cases "Where the input data 


¡97 


processing stage that demands extremely careful 
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is received from sources outside the coco mem 
prrocessirg element. Fortunately, it is at this stage trat 
the accuracy of data can Le checi mo ecc vey, 

[Ref. 2:p. 289]. One adiitiogal cautio ten ata 
mentioned at this point is that data doc 9t decoro 
inaccurate from entry errors alcne. Data nai ve neces 
simply because it is cid! | Previorsiyacelra co cs a 
longer be correct because available new values have not 
superseded older values due to fe lect omur ee 

Vilidaticn rrocesses also aust check for these types of 


inaccuracies. 


B. VALIZATION TZCANICUES 


Te O ON 


The general category of input validatior techniques 
used Ly the "data filter" being proposed okam neS inr E 
in the exact fora in ¥hich it arrivessãor noe e 
technigues involvel detect errers a¥ checking the 
"acceptabiility" of both tre data transacivors a ia 


itself. This checking 15 acecoõs Lico rios of 


bes 


prograaned instructicns/rules, antas ee Very 
effectively by an active lata @ictionary Systea. SS 
tasic techniques are included imn Chome amO Or trace o-, 
validaticn, Zormat cnecks, ane reasonableness checks. a 
well designed validation progra irei eo a 
all threes [Ref 3202s) 

The transacticn validation Fecin vens ns M 
verify the legitimacy o£ transactions «nicht apto uM NEU 
format checks and reasonableness checis Gee Oller fa 
are used tc examine the correctness of data items 
themselves. In order to facilitate a clonone m Wee M 


"Gata fiiter" jiesign which will Se presented Tc Mu 
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4 


conter price? description cf the theme validatior 


netiods isiirzovided below. 


AE e T I B E 


Iransactior vaj9detion should be the first technitiue 
Robe apoliedo It certifies that " a speclfie trtansactic- 
is one that can be prccessed Ly tre system and is Leing 
Subtes. properiy.'' Lc Rer. Asp. 218] Its fotus is the 
veriticatior that we type aw rusoose o£ the transweticn 
ane les timate nrocessinge 3ctiors, agg tiagat tne origarator 
OE ne E ramsne Eron tne dauthcrity to initakte re. 
Transactions determined to be iraccurate ire rejected. 

Related va liiBk on erecks whichealso gust be 
Come lico nas npuncture cl Lae processinf cyclé are 
checks fom Sequential dependencies and/or provers timing. 
For exarrle, a Monthly Report transaction may not Le able to 
hee place sunt tiesmonthive Update transactions are 
successfully executed. 

AN om Gansdectlon Valicdtio2 us a "Ii Zst step" 
Stems frcm the potential lanae wnich could be infltctei 
Won a system EM tae Processing Oo: an sevali€ transacti cm. 
Eyen if the ayala “transaction is subsequentiv discovetei, 
Rover ma co ULrenbelyvesrfIticult. Ar dice of 
Prevent lcm mis case, 1S certainly worth a pounc of 
cure! 

CuGememaroriGuloneviliveity 15s estaLlishel, te innate 
data Ltsel@Sis examined throdgygh a series orf format chec*'s 


and reascianleness checks. 


d FOrmat C necks 


al —— ee er ee = qe —— (ee 


Pa as combate tre actual contents of a fie 


3 


COR = Cm cms cf vser-jgefined rules. sw pecorc, those 


IES 


contents fail to conform to the prescrimee format, eC1eher a. 
rejecte] outright or transferred tolor SEG oe 
handling routine. Some of the rorem orne m a a elo 
are: 

a) Length Checks: used to verify auno e a S 
prescribed minimum, maximum, or fixed amount of 
enaracicrs. 

b) Character Type Checks: use toveri r o 
contains only specifically author abetos 
i.e., numerics only, alphaWetics only, Blanks, ex 
special characters.: 

Cc) Character Pattern Checks: used to verme 
contents of a fieid match a prescribe oda 
alcnaretics, murenies, ocne e e 

d) DaterCreer used to irsure tiat the Conte t M ana 
date field are entered in tbe reuuirew, standaru 


format, i1.e., YYMAD Odor oe 


4. Reasonadleness Check: 


à 


Reasonableness chocks test data items to insure that 
Gata values fall within the limits of estaplishel! 
constraints. These ccnstraints are separated into taree 
basic types. Field censtraini= Liast heva Me e e 
data item. Intrarecord constraints 11mit values between 
fields in the same record. Interrecord consirat S ENS 
values between fields in different records. (Ref. p- 
179} ReaSonableness checks based upon fieid constraints are 
fairly straisitforwvari im Jesigi ocaso 
intrarecorl anl interrecord conetrinc cicerxs, Forever 
With logical accuracy and the irrterrelitionsios amas: 
itens. As such, they are muci ror Wiens eee er 
manage. Common reascnableness checks are: 


a) Freld Constraints 
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anje Checks - used to verify that tne field value 

A n a Seoel leo Danse, 1.¢6:, the value (ces 

NOTADO Late nan e Ono nas 

NS OC Checks used to Cesp a speclaliy crea ty 
fieli to insure records are processed in tae proper 
Ser ese checks ere also usel to verify the 
Peesence Of all requirec records. 

- Completeness Checks - used to conltira that each 
nog aom Iesus record is filled with a datae 


item of some crescribed size. 


A ns sed to verity that the contents of a 
Sarceda race not violate carllest or latest 


acceptabTe Rate restrictions. 


tt 


IES uso cColyeriiy that the conterts cf a 
Cele ronco @entained within a Listing of vali: 
ana Current celes. 

a cet coco Constraints: 
=~- Com letenessS Ckecks Misedmomdortfy Pose ielts 

INS coco Misch must re filled »5aseàj uwpon the 

coments “emmcother fields in tiat record 

Ca ea ae eOr OENE cecoris {interrecord). 

on e ene Eso =" US cd to Verify that the vaines 

Re lei  oremyal): rn relation to the tata 

maine o Menor rields (eitcer in the Same reccri or 

ctner records). 

Apso Nos dn Iintrarecor!] Compieteness check’ 135, 
cre cones de or S lelo in a cécord is £11188, 
Bren them@onvers on Code el) ain tnat record nist also te 
silled." An interrecord versior of a completeness Checx 1s 
(Owl en e VRS Multiplier tied is íi 
Toco rosa on themeaii VRE Multiplier fie 
ME 

Ame oL am UNntGrarecord consistency check is, 


IS PUSO na recori is 631, then the grade value ia 
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the record must be either EN or 25" Rae 
consistency check is "no SSH iren E coc 
the SSN field value cf another eco 

it is also Sossidle to have vao 
dependencies, e.g., a recoro with anos ete!) vo e 
"9999936" in file "A" must have the saze 40S field value as 
a record in file "B" which has an Zdegtrca 0 "E 


or Moo 


C. EDIT AND VALEDATICN RULES 


There must be an crganized and Consistent gevrod Eer 


applying the validation ckecks cire OVCE IE 
a O 


input into an information syster m iiye rie uu cM s 
appiication is the edit ard vd lijat ionic) oe) con 
explicit statenents of Strasposm gout a ina 


system. These rules zonitor the Lasis structure and 


relationships of data itens, and enforce processing 
restrictions established by the incoraation Woe 
fref. 6; ponto 

Two key issues Ccrcerninj EVES aust Le alipessedl when 


buildinj a Gata validation Systane Zar csi a Rods 


properly Gevelop Consistent caules.  Consisitenc nn an 
accurate data, wnereas Contradicciones o 
unreliable lata system that eventually will crası. 
Definition and development or LS qm be Carew, 


e 
chapter four as an inteyrar [part 0. Ae ia 
lter” design process) 

The second key issue iS where to »vlace an EV? module, 
(1.c., is it better to epbed) Vt tes farto aero camada 
program, or is it better to maixe 1t e separate vala 
projrar?). The use of ar aci velcitalaccionmo mao 
filter" argues fot the latter approach he mae Yona M 
SUCH A PLATZIENT 25 Set SORTA fiw o 


e 
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RS BASIC CONCEPTS 


Nolasco Concertos central toma clear qumMerstaniias 


A 


Gm ow adata dictionary can Pe used locally tomwalidate 
Gata maintained and provided Ey otner sources. These ate: 
pe er orar, e taddata, Active" patasDictienary, am 


pata Extraction; 


ips caco Dictionary 


DG nU E sEccceniEouzeg repositcry c2 al: 
RE cunmpivo information abovt the retevant dati in an 
enterprise. Tne data dictionary provides the user a 
Ases ion Of wateta exists, wvkat it looks like, anil 
cansa nerd DD. TTA deba dictionary caa he as 
Simple as a manual catalo; system or as compiex as ar 
aio ma clics portals whica controls a vile range o: tre 


Cvecr Plice e mta erecessing ocerations. 


wie Metadata 


A A _ — á O L, 


pe rem o" an enterprise CONTHINS à number ot 
Rae or cor Dl les RT: ate rerreserteã ir the 
enterprise's information Svstem as data elements, recor's 
du ico one ram le, customers (entrrtv) ure represtrtc} 
Es scr eia clonents/ilelãs (CUS: _ 25, CUS. Meee, etc.) 
Wc ncia au coras (CUST ara which, LE tutti, are 
gran de tomas Cost rito). The data user to defTre anc 


descrite these entities are calle! netalata, :.o., Jbta 


about the data. Metadata are stored In ir ca 
forming a metadata database or metadatapase. [Ref. 8:p. 9] 
Dictionary netadata contain terciaria Zu cs M nc I 
object. The metadata answer the colonies Tons. 

a) What data is avaslabtle Iin the demum no 

b) What does the data mean? 

c) How is the data structurer? 

d) What constraints and relationships exist picar 
dictionary metadata include: cl5ect nc, soro ac; 
synonyt or aliases, source, narrative description, 
records/files that use er contain the ata oljecu, = a0 
structure/format, integrity constraints (2.9-, Nana 
and relationships/dependencies. (Ref. 9:p. 139) “etadata are 
essential ingrelients in the validation of data Ly a data 


djetlienary Systen. 


= č q m m un eme e <a aid ee D 


There are two basic modes in Whicn@@ lata disto 
can function:  pvassive or active. A PaSSivenia Gomme et ican, 
merely registers the retadata ani provijies tne user a 
facility for interactive ery and/or to rere Seca TI 
does nct reuuire that data processing Srerations contos 
it Zor metadata, and Lo direct Jinx 25 Gartairod Beticces 
the passive data dictionary and other system components. 
(See Figure 3.1) In fact ara oso Sele ie 
processes may obtain their metacata ertirely from other 
sources. 

An active data dictionary, Fon the orem 
exercises a yreat deal ot confere) over T 
metacata usage within al xntopmpadt ron vto a 
dictionary is said to be activo tn E OSEE 
information system, if, and only if, tnat systen is 
dependent upon tne data dictionary oras taco 


FIgugece o 


A. A r ei DT DS AP cp errr aa- em eS 0 Rom 


| | 
Ins, A | 

| e A Data YProcessin ; | 
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| = 
mni | 

| | | 
| | 

JU | 

gu o Ce 0 | 

| a. * Ost EET D [<--->] i LM 
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| pra scm | Base m 
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| 
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== een | 

A Se Bo testa Co | 

* = Software Interface | 

= QncLice Storaje | 

| 

| | 
zn... A DD —————————————— UNS 


Figure 3.1 Passive Dictionary 


À dictionary ls active to a lessor degree when Gi LY 
some of the system's [rojrams ani processes are lerendent 


upon it for metadata. The more prograns OL pro 


u 
yl 
¿de O 
pos 


Rei on Cree vonary, the nore active it i d 

Rec Ee inc vale om» active cdatandoctionary stens 
Powe thedeslan icilnent or nasdatory interfaces bət 

and varicus system prccesses. Senne cata wetionzarzy ls 
used asus uei Filter", these wariatory interface 
Munera CA data coritorm to pre-delined cruies an 


Stan ards: 
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4. Data Extracticn 
Data extrac ion d 
data frcm a very large flle 


ES 


Cx tract 1c. 


a ich Staller "exti icrel 


DONO 


ST 


process can 


complex. A complex lata 


coslect, format, ana ini Saa 


filesydatabases into a sil: 


specifically tailored to the 
Of users. Such a system inv 


Data proces aaa IP | 
K LO H co MI | 
6 Arpiication | SO 
Programs Bn Ic al 
De Lee | 
| ja 
| [rl 
Su po 
| E ye | 
ça l eae 

| | | 

N V V 


| u o — —. is =. — e e --— En 0 —  —-= oe Ee O = == e 


-e = = e 
m, æ æ a 


a" tlock shown above includes 
S used by tne data dictionary 


anal paat CME gaii riak PE aiaù aetna OS ARA y gg Gees GN GaN GEN neget 


Active Dictionary 


techni jie whereby a subset of 
Svsten or dutabase is transfere 
"cile opta tálaso. Jem 
tier vllitezsınple Se ver! 
raction process ls dosis 
te cata [romai numb»er oso 


lata source whose C sae 


needs or mar nes e 


olves exten vo at 


N) 
PJ) 


gescecamt ion ssubsettin,, aggregation, an)-presentaticn 
p o ns eurer. limp. 245] This thesas Aflresses lata 
eLa CCCa Oon a nuch Sinpler perspective, 1.2., as a =weans 
Rom bimnitoche size of tre data te be validate? py the lata 
dretionary. Mn nost cases, user apolicatioms do not meel 
aaa corcairned in a large Jata Source. His, tre 
Ntra ctien tot oOnty pertinent Cata (a much S@aller subset), 
usually serves to increase the speei of application programa 
acc none thesuqra Such data extraction operations car 
Poser d to lqreatiy ennhance che efficiency of the nreposel 
“data filter” wren large source files arc involved. A 
imasgran Of 2 Simple Mata extraction Hesiyn which can be used 
Mme ma certo a data dictionary "filter? is sbtown in 
Figure 3.3. 

Puro Rome ner of this thesis, the tera 
Macael e ter Co the active Jata dictionary 


~ 


validaticn system beirg proposed. 


B. CONFIGURATION 


1. “Metadata Generation 


pice we tome or omeancuirg the data filter is 
DPicORECOnd ute eitOmimeGdtdGiotlecnaurvy the Japarility to 
generate the metadata neeled by a system's edit ani 
Dan satronzso name Tio metadata yeneration 1S triyger2d 
IT ni a l ation Soltvare throu,h tie issuance o£ 
THT a calle phranoters. The data filter must De 
sone eu remo cast an) valiiation, vitn its mnanlatoce 
call for metadata generation, 1s automatically activated 
ducing al: Marian put operations: Firms resulting metadata 
generaticn nroduces data Nescrijtions based upon the 
Characteristies stored in tre Gata dictionary hetaldatiraseo. 


These data Jeseristrens are transforméll into specific edit 


pup $0 di ci 


Texte ena Le | Boat A TI 
| DATA az EX IRACH TO 

| 7500285 | S T3 IT | 
ee L 


A a He tao A tru Eum Errn ian patin ta 


E 


and valication Pules an 


[ Ref. 


for use by 


validaticn pros nando 12:1: 


2.9 Eds ane ias Ion 


— LN M m dr — 3 = — —= 


Fdit and Vai 1Uat1 on arenas 


application programs which enter the 


be executed without iata 


EVR) 


They cannot 
the form o£ 
Catar 


thrceuc m wu Een 


The tailoring of the projraags tc speciiciçvões o Rs 
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re ene erent EVE Lead passed to an editrand 
valulBaticnebrouwmram tham will a I4AD data entry (24A2 data 
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Figure 3.5 The Data Filter System 
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time saved Lecomes quite significant. 
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Mimos dle data editing tala validation systeas provile 
AS eta camabilty to validaate aad edit data, and to 
correct and report extoneous data. There are, however, 
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OR onn ee o metadaça cor atl-ecrt ard validation 
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consistency is promoted. In essence, a much greater degree 
oË control over metañata 25 realized, ani, as a result, 
Rae  mconsSietenmtevartrdation Oz ata is achieve. 

Second tac tionary atforis thê user a very 
re e con aus table validatiod mectanist. 
GU acc revisions te validation criteria de rot 
Ree OSO application erosrans or elit an! 
validaticn rrojrams. instead, changes are easiivy 
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An interdevendent extraction/validation process woui3 nct 


allow this aiteərnative. 
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IV. PLANNING AND CEMERAL DESIGN 
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A. KEY DEVELOPMENT PHASES 


AUSoptwareseproduct's absliity to odo wnat it is supgose& 
to do efficiently is larjely ycverned by the gaality of tie 
paugeededcsronedgadNccdrmo that creates it. To turn, 
Success iui detailed design amb colinSare directly tied tc 
Sie. (ld irnitrva) planning and dos activities. 
En ee one na an creelicinar;, design sters taxen o7 
Sens ito upload focal Gata tiver are crucial, acd wust 
Ce corprerersivelv aro eırefully accomplishetl. 

Plane pec anc pn cial desijn or a data filters is a three 
phased process. Phase one describes the systen's 
envVreceaneat arcaneneralselaralteristics. Phase two develogs 
oaea e a Ons an l vali dation criteria. Phasc three 
Pron uUcestar META oc Cal Jesign of the system. A 
description of each of these phases 15 presente! below, 
eon es ecc relcvantefyestiors Wi.ickh serves 
G or Droccedhm; throug tre ph 

Se ere eels nomen a Crarcevora vlthNiz «ich 
users/developers can sethoflicaliv develop the data filter. 
The fraWeworx assısts tıen in: 

ISgec rem T comwrenonsive pictures af tie 

Emo LE ERE otter will £unctica. 


dent Hd dog amo lelinind tisseedataeto- ke valadated, 
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and esemaining tha nature agdascoscoê varidatior 
required. 
SI Stand vel -SCofirea, funciiomaliv structures 


validation and EVE modules. 


B. PHASE ONE - SYSTEM ENVIRONMENT/GENERAL CHARACTEFISTICS 
1. Eescrii 


This phase identifies all aardware and -:irmware 


Gi 


eing used (or projected for use) In the ovezali iniornarior 
systen, and describes its envircnzent leao o truco E. 
centralized system, file syste vsim lata ndS ee o nr 
It notes vaiidation capabilities alrea em an 
system, and also identifies commercial validation 
capahilities which are conpatihle with existing Hardware 29 
firmwdges 

Phase one alse uncovers the yeheral nature of the 
input data to be valicated. Tt identities the broad 
categcries of inrht data, examines Gara Stabe ead 
consistency, and looks at who exercises control over the 
entry of data into the system. mis pnase oOWelines waa 
entry methoês and notes the various processinj stages at 
which data validation may occur (nrocrpput a 
etc.) An overview of system output is aiso formulated. 
The level of aceciracy re julpe@e ior theo it pontas 
established, and the degree to which outout validity is 


dependent upon valid input is leterminel. 
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distributed? 
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C. PHASE TWO - DATA DEPINITION/VYALIDATION CRITERIA 
EOS ELON 


This phase ilentifies ard gern szene er un, a 
entities. Tor the purpose or thesdata rilrer; datadencio: 
include all data elements entered into the system and the 
records ard files which contain then.) Ine wapel reat tem. 
wLich use/process these entities are also established. 

Phase two alse sets forth all validation checks 
required. Data element characteristics cuco een 
range, type, size, sequence, etc. are recordos and all 
entity relationships are carefuliy delineated. The 
inforzation developed during this phase foras the data 
dictionary metadatabase, and 1S used) fo Constance, spe 


system's EVF and validation program modules. 


Answers to the questions listed below will enable 
the user/developer to identify, describe, and determine tre 
interrelationsnips of ail syster entities. suc 
able to eStanlish validation criterio m or Gabi oem S 
cross-reference them to the Gpeiveations which Peirce as 


Suck Varia Ton roce a” 


a) What lata elements Joes the systen contain? 
5L) waat recorá(s) cortan Blese dave. € lenests ? 
COLD S 


ese 
c) WEat rllets) ccrta cea 
4) For cach application ne cm 
- which files feed it data eco risa 
- Which data elements does it use/pbrocess? 
- Which data elements nust Le validated (i.e., does 
the validity cf the application eran ont seid 
this input data element beiny valid)? 


- IS a specific sequence ci Mata entry secure 
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ao recent ry updates/trarsaciio.s ahst ccaur, i 
any 
BJ) Por eden data elements 
= vhat 1Smits Name. Aly SynOLyme OL aliases? 
= What is 1ts Short Nae, rrograaning Nane? 
= mt LS its IL 
Ednardo cs ena raices: ype (alpha, migeric, etc.) ? 
= nal minimum ana dvaximum number of characters ave 
allowed?d 
= What nuneric value range applies? 
Sa CES patear, 115 used (esta, CCCSANPECC)? 
- IS there a mirizua/maxizum range of ailovakle cnangjge 
from one update to the next? 
Eras case lanilerceci relationships exist with MWier 
data elements: In the Sane record/file, in cthef 
Rem sms esq. 12 "A" is changed, "Tnen '2” 
must be changed). 
- Is a particular update sesuence requirel? 
- Do date fields have any eariiest or latest date 
linits? 
- Jo date fielis require a snecial format (e.y. 
Vio ope 
Ena pec celacionstics exist with otier lata 
gens cd vValue of "a" aust always Pe twice 
tud te lee 
- Is the data element a cede or a value that se 
cec Mare kaplesor Listin] o- villid Weiss oc 
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D: PHASE TIRED = TPNITIAL LOGICAL DESIGN 


WTO SCI PERA 


Phase three produces a model of tae lojical 
structure of the data filter systeme uuu: - 
"built" (during coding arni testin) e a o o 
tasis for ail further design steps ar e Ee e 
preliminary logical design lsz tie koy Sc ca 
filter design process. The data rilter strucbuseeNdevclosed 
during this phas? is kased upon the general filter design 
cited in chapter three and the system environment and 
data/validation information gathered during phases one and 
two. 

Phase three cives the user a lescription of the data 
filter syster goal and objectives, ari presents tze zascr 
system functions. These major functisnsg aro then i eono 
into sub-functions until a Series of sin e, ahidepenlens 
molules nave been identified. This overail system 
architecture iS depicted in a Hhierarch@cal structure dvajram 
(See Figure 4,1) acconrpanted e DON VENE nd EMI 


tne modules. 


2. Checklist 


Answers to the folloywinc TI CMS 
user/developer to prcduce the irl orno rone e a e O 
a) Wnat is the goal of the system? (State the jenecai 


lorg-tern desired elrece). 
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Statei system gaal): 
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“hat are the systen's mad cr fume Geom 2 uu TH 
general processing activities required to meet syster 
objectives). Fer example, ados NM ec.) naeice our 


systen may have four major system functions: (1) 
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Pop on tin geace GUNENA1Minristration (open accts., c&bse 
accts moto) | EEUEROtessrmg deposits, (3) crocessings 
puc HOT an accoúrt transactiod 


database. 
Device Mom esc (sub-Cinctioens) comprise each oi tre 
Sa Or Un elos: (Lit to no c9re than 3-5 
E 


Nou les pers anctlo», armó receat tie p 


level until nc further roculeldeconco Dimas 
necessary, i.€., Simple, independe modules neve 2 
created) 
e) What does each system modulo do? (Give a precise, 
O 


corcise descriLtion of aLproximateiy two Sentences). 


Cnce the above vhases have been completed and 
carefully documented, the data filter structure has been 
tailored to the user's specific environment and validation 
necds, Subseyuent deveiopment invoiving detailei design 
(Gata fi0vs, Cata stores, irterfacçes, etc), RD RMS, 
testing, etc. can follow using cne 97 agammber 70. Zee 


pethodologies which currentiy exist.: 


V. THE DCSPLANS "DATA FILTER" SYSTEM 

ets Chapter specifically adivesses the SCSPLANS' "Zata 
Filter" system. it provides a stateaent oí the 5vsten's 
overall goal and its key objectives. It also expands ta 
gem ral data filter desigh proviled in chapter three into a 
emer ceta1 oe) Rierarenical design Structure tailored to tke 
DCSPLANS situation. 


Ae UCOPEANS SYSTEM (GOAL AND ODJECTIVES 


AID E O DOS PAS” unime operational characteristics 
must te considered wten formulatir; the systen's goal ani 
its key objectives. These critical aspects are uncovere:l 
during Phases I anid TI of the prellainary deveiopgaent 
activity (presented iùn tuhe previous Chapter), and are usec 
to create the Phase III deliveranles iilustrated in this 
Ca ter (Sisten Coal/Chjectives anã Structure ningraz Sich 
Ma to alero Eno CESCLAJS characteristics 
náwias tle greatest irpact on tre yenerai Systen design are 
presentei below. 

ZRezaor Bern ortagt Fact S that DCSPLANS personrel have 
IC H aey Of Liput "data thev are 
PASSION A ES OE very Largo source Siles pretarel 
and maintained by elements outside theit svan of centrol. 
At tne presenme time, ECS2LA {ces not possess tie 
Ciba au de te tits Mectionay PS input Tata. They 
Are, however, extrerely worried about the adverse Imract cf 
such Tao Ene vaildity oz model onstouts. 

Meer les provide crucial Wata to DCS2LRANS' 


force a Dem UgelswEMacH of the riles reais a varying 


2 


number of models, and supplies à unique sor or 
depending on the particular nodcl avo LL 
data elements contained in tae source files ani tre data 
elenents required oy the models remain the sane, creating 
relatively gooê syster stability 1h this Ec car inca no 
Fowever, occasional chanjes made in the data elenents 
provided or reqhired. A DCSPLANS vatisumtiuDnELOS IUE 
provide the Zlexibity to zneorperate Sic ge 2 - E 

In many cases, aclels using the same data elements from 
the same source file require different degrees of validation 
(C.3., the validity o2 input Mata onen nr uu iu Nm NM TE 
Enlisted Master File lay be crucial zac; NEU ML 
Personnel Readiness Indicator McJjel ousa 
inconsequential to tře validity or ouDIt prosuccN NE 
Personnel Policy Projection “Model I SS MSIE 
validaticn tool must ke able to differentiate between the 
valldaticn resuired for inlistec “Master File datam emo Hsc 
Ly the Personnel Readiness Indicator Model as opposed to the 
E3M, and it must apply edit andovailuEStyYon rules 
accoroa kiol y: 

Generaliy, DCSPLANS' aodels are run or a standama 
schedule wiüich coincides with required Driefinys/se portimao 
which aiso facilitates use of ore model's output as input 
for anotner model. There are, however, occasions when a 
model's outbut is required on Ver ween] oc ans E 
circumstances, tine tire normally devoted to lata valilation 
Ray not he avaliable, and the DCSPLAMNS' models weuld have to 
be run in the yuickest possibie time without regendo M" 

ntegritv. While sucn a procejure seems unwise, it may 
cccur, and the QICSPLANS valigatione tool cist OVI E 
Such a contingency by aliowin iS T ce Girne un yoria ES 
required. In this regard, the ese e 
be a mandatory part of any irtecrai data extracto: 


nodeling process. 
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Mesra Jot Sf [CSR trocair j activities] 
done Inda batch mode. The extraction of mertigent data ¿708 
Hinge input Files is also a S"tchoDESWES5s (ce, “Ole 
A Spro ran develope leani Sel Dy DCSOPLAMé to etenct 
puwLlnent datid from the Eniisted fdster™ riie).  .owever, 
cupabill:tios to Manipulate data remebsuE,; uetadata omeiise 
and to query the metadatabase on-line are cruciıl to 


errmetzve,suser=z rıendly operation of tne data filter 


ET) 


“eet. 311 other Gata filter processes (e.q., ES 


formulation) will be done in katch modo to insure run-tim 


ub 


Ung cI1eney. 

Based nPon an Sxarinatior oi the overall DISPLANS 
EH nalen, san akerında On chess Olnts gast xentiomen, the 753% 
SEL ES Neta La E to valilate all 


exteenally covidel? input data Usel Ey DCS2LAMS*. force 
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alignment models in consonance with established DCSPLAR 
duality control standarBs. 
Kevon e tive Mo le OCPI Ao data filter syste 
C te no sc colconpatible with the existinh BCSPLANS 


ccuputer systems configuration. 


2. Zt must allow flexibie ard easy additions ani upuúates 
to the metadatahase, 
PD vis once race Prith the Cata extraction ange modclix; 


rrocesses must be optional (at the discretion of the 
Chilo y wee ns OtNeEwice 2t will fe fim artomat:-, 
Nuno eek ea) 

aise vie fOr tren ditonduie agagustient ot elat 
damna E NtIOD rules to sujt the carti ular Source 
poema amoce) being processed. 

A mom io ar intoraccive 5e query faciliti 
fer accessing the metada tahase-, 

0. Lt mist provide am erzor/status report Jeneraticr. 


facili ve 
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corservative arproachib s dese en 


B. "DATA FILTER" STRUCTURE 


This section uses a structure diagrams (in modifie] 
forrat) to set forth the proposed SEcucenre one Dean 
"data filter" system software. The structure is derived 
from a functional deccsposition process in which major 
system functions are split successively into sets of 
sub- functions. The pfrovosed DCSPLANS svstem wili be 
decomposed to three levels. This decomposition denorstrates 
the nlerarchical control Struct Use and celalion mp. 
modules which comprise the overall "deta Tui es a 
It does not represent any particilar process se MODE os 
crder of decision-making. [Reta co 

"he structure diagrau is noraally presented in the 
graphical format shown in P1gurc «S Cosecverio com ie 
crowding effect that will occur from a three-level 
decompoSition, the major system functions (level 1) and 
subordinate modules (levels 2 ard 3) are reprosentes here in 
paragranh/sub-paragraph format (Seema e 
depicted in this manner are casiıy transferred to a jrajnic 


represertation of thc overall system ucc e 


MN structure Dragıren 


cam e ee m m e (EHEN == 


ihe proposed data Ellben syseen contains fiver on 
functions (Control Data Filters Ceo 
NetadataLase, Produce EVk, Validate Input Dati, Geherate 
Reports). The system's hierarzchica) Gtmictare nm 
illustrated below, fcllowed Di descrito Sol o 


function, sur-functicn, and lower level module. 
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2.2.1 Read Add Data 


2.2.2 Check Uni U TESS 


223 a Check Fora 

2.2.4 | Accept maes 

Delete Metacata 

2.2.1 Read Telete Request 
4.43.2 "Locate ea m 
2.2.3 Renove Metadata 
Change Metadata 

2.4.1 PBead Change request 
2.80.2 Locate Wetadauead 
2.0.3 Update Aetalata 


PRODUCE 


5e 
De 


Centros 
Retrieve Metadata 
3.2.1 Read Source Sile/Molel Codes 


3.2.2 Open Vetdddatag 11 ca) 


N) 


* 


323 Extract Perticenmemsata Waluo: 
Formulate EVE 


3.3.1 Load Variables 


^ 322.2 Set Zw euere 


VALIJATE INPO DE NTA 


4.1 
4.2 


C Cnt roe 

Select EVE 

"4.2.1 Determne Gone eco cin eS 
4.2.2 EX ractie aanle Cv: 

Arrgly EWR 

4.3.1 Read put Dara 

Gelo: Rea 

4.3.3 Check Para am 


Proviie Processed Input Data 


Woo erie str ror cele 
Lot. 2 JNpBansfer Ecroncobs DatAgwEsEOr-€oie 
1H.) ans ter Valid Datu 
455 Maintain Statistics 
es. Do daintain Transaceron art 
AM - MarstadiwwWError Count 
0,028 


AO GENERATE PORTS 


SIOBUNEEE EO Lr am IXOS 


SIS NETO 
5.2 Retrieve Pevcrt/Response Bata 
5.2.1 Deterrine revort/ResJonse Tyne 
BZ. Pead mo). icaploc Data 
5? Peto] Calculations 
5.4 Provide Report/nPesponso 
Sq] Determino Format 
a pobla ti Data 


Bag iransfer to Output Device 


2. Narrative Descriptions 


— —— oe eee — — A = 


P 
e 


Sie Mol are suecclrct exbianbtions of the key 
Ases ol cade apre tore dlagranm function, Ss«£e-wnction, 
and module. Sach lower level description serves to 
Rec ne anime casi Oc LES superior levem 
= SO CORO DREI LT SR SY 


Fed 
Uu 


24: Fiisaconetuilor cóortroe 


Ln 
=] 
tas 


In 


access to the data filter System and verifies 


so orovides scmens Cer 


Pi 


franca Tonal City. It a 
a eT aor svstes functions, and 
traro rer contre! to these processes. 

o Gorro AE AS AAN. VALIDA Y:  ThuiS sub-functioa 
incurcat hai Che Mser 15 authorizenNsaccess to the 
SiS tomo ce desicod trassactidn, and taat the 
truusuqetrom tselt rs valid (2:jJ-, an attempt to 
Validate tie Enlistel Maracenent File for use in tlie 


Arceo Promnoticr Nioliel vovil pe ce jecteil). 
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1.1.1 READ ACCESS AND DEANSACIIONEDES Rs don 
reads in the user's access cole and the transaction 
codes indicating the desired process and the Source 
input file/model(s) involvel. 

1. 1.2. EVALUATE CCDES53 Zange meta ce cuca 
codes against authorized access and transaction codes. 
141.3 IMPLEMENT Valiant E Tuas module 119 
either reject tte transaction or 2ass ar irdicatior cf 
a valid transactica to moctle 1.2.1. This wodule also 
sets restrictions within authorized processes (e.g., a 
user may be allowed to aid mec aes, ee te eer a comes 
delete existing meta ita): 

1.2 PROVIDE MENT/SCEEEN: ascende: 
the user with the appropriate screen for continued use 
of the systen. 

1.2.1 READ VALIDITY DECISTIGR Sa amnodu cesta. 
validity indicater Prode So 

1.2.2 DISPLAY APRPROPATA © CRA Male ote wie 
elitler a menu or Screen, as appropriate Fo apReir on 
tne Monit om. 

1.3 TRANSFER CONIZOL: hos sun bosco c E c ane 
tc an appropriate system Pew Mc pO COSO 
done uns 


1.3.1 PZAD SCEZIN IN2 mono reads wer 


Es 
t; td 
a 
K 
ul 


responses to terrinal promt 

1.3.2 D2TERIINE PROD Sap CG A T Ha ee 
user input in terms of Eee Resina cten fue 
(e.g., update metadata m CoCr OL c 

1.3.3 PASS PROGRAM CONTROLS hos nome asse cacos 
to the appropriate Ssyse eo 
2.0 MAINTAIW METADATABAS IS E OS X 
zetadatabase entries, deletes ueta latabise contents, 


10 
and makes changes to the existin; metadatabase. 
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NC co Chissbm- limetioreiTsplays Tre 
netagatabase ren, Té" Goverrs*the activitiçpr and 
semuence of aldd, change mm Cc de! eve vrocésses. 

2 PROVIDE (UETVDWUUSASE MENOS. This module ERE 


a menus in tne user "o tlons "OT adir; Bel 


(0 
y + 
bas 
L3 
u? 
O 
ry 


changing metadata. 

Z. 1.2 TEN SEE COMEROL: Tnis magplo asses comurci to 
e1gner myles 2°72, 2.37.01 2.4 ,,epemding of wget = 
reguest and access autnorization. 

2.2 ALD METADATA: This suk-fünction reäds netamnta 
Tue, Checks Maeror “duwbicatron and Proper entry 
format, and either rejects the in»ut or stores it ia 
the metadatabase. 

ZEN DD ee This neo@ule reis dai wraich tte 
user vesires to enter irtc tnemetadatabeso. 

Zee MRE SS  ThlLSModUfo checks 
netadatabase to insure data to be added Joes not 
already reside there. 

212003 CRECA FORMA: This rodado checks data to Le 
agen TEI tr [P Esri starfa MASC ada 


G formats. 


Vata 


oo eee ee TA: Mies Neel le eva lates Tesut 


un 


C 
É 


pi 


D GT E o Im OCeSsiny, andreitwer reje 
data to be atided or stores it in the netadatabase. 
Je Oe eee ee een IES suoerinction resws weTalata 
Geletion request, locates the data in the motacataziíse, 
and removes it. 

Ze ame E TONE Ter weal be reais th 


User wenreotost tcmwelete Jata. 
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2.3.2 aT TSE This olule locates Prlacateı 
etadata in the retalatabase. 

2.3.3 PUNODYZSMETADATA: This nodule removes neta te 

from the meta database after a re-verification cí tre 


uSseL 'S@™esrre to delete the data. 


2.4 CHANGE METADATA: This sub-thaciior Fran 
metadata change request, iccates the data to be 
changed, and wupõates the dota abore MM 
the new uetadata meets the prescritel extry format. 
2.3.1 READ CHANGE REQUEST: Cias ao bo eea dE 
user's request tc update existing metadata. 
2.3.2 LOCATE METADATAS Dor sapo e cars the 
metadata to be changed. 
2.4.3 UPDATE METADATA This module eco us 
metadata with new metadata. 
3.9 FRCDUCE EYR: This functiondpeSdd ces Cc M NUM 
validation rules for use by sSub-rPunctxen diu 
values are extracted from the metadatahasSe and are 
transforned Inte bDoindel ccnditionail Scar nc M RO 
which input data val be cia 
3. 1 CONTROL: This sub-Zunetionzipveurs zer a e 
and sequence of processes awwolveit Wath the Produce: ca 
t edit and vaitllation fives. 
3.2 ACCEPT PROCESSING ECOD n a SEE 
the source file and modei codes entered F7 the user, 
opens appropriate metadata iles, extract mirna 
metadata values, and stores then in a "variabies" file. 
2.2.1 READ SOURCE PILEZACGIEL CODES: has tera Rs 
the source file and model identizZication codes entered 
earlier by the user. 
3.2.2 OPEN METADATA TIERES DS): ger hoe aco 
and opens all metadata illes containing 1E e 
to source file anid medelsrote) by Rodulo RR 
2.2.3 EXTRACT PETT NOIA This mos 
extractsepertinent metadata valuos from opered 
retadatabase flies and Stores tue arta da a aca 
file. 
3e3 FORMULATE EVR: This sub-ruHncMM ess 


metadata values stored in toe "vaunmibles" Fite yy aie 


X 
f 


file of pre-estaltlighed cor@itional statements, therc5y 
setting zswitches.eitksrnon.ror Of aif settingauprer unf 


Tower WOuNcAarlies of accertable invut data values. 
(Settintj and boundaries will thererore vary accord ias 
to the coabinaticn of source rile ani nolel codes 
presented by the user.) 

ECC LON) VAREAELES: This nodule reads tie 
"yariaclest file into a file of pre-set compjitional 
Sstatenents. 

3929.22 SEI SHIICHES: Sils seculo, Jependwe on werrjiatie 
values, sets Swithches eitrer on or off and establisnes 
upper and lover koundaries, as requires. 

ASIN AL LONA TN EU TADA. This Cunctionsictualay 
penteras the validation y selectinyospecióno VR, 
Ae O Cuan but "ata,  acdo providing tha 
Rec SD ata tO MMe ther a "validated lata" f 
Cama o e TAS function also mulata las 
Seatrsuie smorschernunber ci data ¿Ems Processeiirarl 
TTI antecategury of ertcors+iound. 


421 CONTROL: uic ¿SuSE Neeson JoOverns tre aciigatior 


pe | 
and secuence of frocesses involved in the actual 
valido son orsımruıt “data. 
Bea men NN has cut-junct;one lentiíies ties tye 
ct cecosls) Deins valildatel from tne source File, anda 
activates only tlose BUR wrich apolv. (This 
Sl. ivemuonmoceC Wea wtie vaiilation proj fay eipon 


eee cam cura” an input cesecrêMpast at] Scuree 


rt 
th 

(mé 

D 


thle VR e ceny eniancing run-tseee cfficiosey oc 
CWOGLDHIISDPOCOeSS.) 

QNEM DIGNEMPSENMUNPUI RDGCFTDO Es: ARSS lu le 
ldentritres the sub:set of recomus taatare Bng 
wvalidatele&rom tie source inpu 

De eee PT LECAS os. FR: This dm *extracts 
Siew tios ci applicable to tac s@comd tvmes being 


validate., 
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4.3 APPLY EVR: This sub- functione: m Eee npc E 
and its associated AVR, anc conp ir PE o 
Copel iinges 

4.3.1 READ INPUT BATA: This edu lees cdl o 
invut Jata to Fe valsa T 

4.3.2 FREAD EVR: This module reads EVR from mocule 
S 

4.3.3 CHECK PARAMEDERS: ETIE THE TET 
Gata to EVR parameters, assigning an appropriate error 
code (incluain Sio Torna n): 

4.4 BPROVIDS PROCESSED IX FUT DA is Sur mec a) 
reads the processeé data anio rro 
transfers tne data accord ima 

4.4.1 READ ERROR CODS: Ihis a0 cio ToC TEE a TT 
associated error code Fronlnodule mao 

0.0.2 TRWSTER EERONZOUS D MEV C rO r ahis mozu e 
transfers erroneous data with its associated error cele 
CG deer Oe eee 

4.4.2 TRANSFER VALID DADAS So oe nc 
valid input Jata to a Wyalid i ed miea o 

4.5 MADTSTAIN STATISTICS) This cun nne eS a ee 
running count cf tae nuaterl orfstreransact AS | 
and the number ard type of errors found. 

4.5.1 MAINTAIN TRANSACTION CONNTR This nodade 
Malntains a running COUNT c: pc ber ore erate Gen 
rrocessedl ir a valivation acta 

4,5.2 BAINTAIN ZAKOR COMM so licor a E 


rukker of errors found an noreste) rc 


4.5.3 SORT ERROR TYP SS oe fos on Oe Goo > cme vrce ate 
activity's error come or t ci Or. 
2940 GENHERATZS REPCRIS:  Thas fubciion accepis nessa 


for both prirted reports THE 


responses, determines ani retrieves the appropriate 
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reror t/cesponse data, pericia a at LON and 
Zormatting as recuired, anc issues the reguestcd 
Perce t/les ,onse < 

Seon TUS SOS UE CLIO TO TRE HS thé activatica 


n 
and sequence of processes involvel th tno aroduction 


"I 
Of Pr ted Leports andan teractive response to tecilimal 
queries. 

IA AECA MaS PONE JADE: Dias sub-furcidor 
determines the type of repert/response desired ana 


pedis codnicedadsta ron araropriate ciles. 


E ENIO CO ONSE TYP: This 0.06011 6 
TD tre mer request Cor infornatioA in teves ot 


LOBerc/response content. 


Sc Piet Oe Pease, DATA: Seis nodule- locates, cem 


e> 
K. 


> «y 


TTT Stores the data neeied ror the 
requested rejorty/response 

A ERA TONS: 7115 Sub-=2unctior doteraines 
oe terca locations are Feguired to produce Jesirel 
Pao mam int so, lt reads Lhe approvriate Data 
an crhom MU MEC died cperatioas, prolicins "ren" 
repcrt/reponse gata. 

SEE O POE Ie This suprpunction detor&aszcs Wu 
o ec ES ponse rornat, formats the data 
3cecom a tears te: Ele ¿ormattela data to the 
ap ro prlalse tone put device. 


E? T E DOT TORYAT: STD I ECE LEE 


IN 


reo ua tel Lor tae o*sirel response irn 
accom ncc ItIcuse-establisHm6 Coca »airameters. 
F 


DER s X aS GClake astanges lata in prover 


format. 
SEC OUTPUT DEViES: 21.5 niulc transzcrs 
the Formatted data to tne appropriate ogutout device. 


us 


C. "LATA FILTER" ENPrLENENTA mm 


Two key advantages innerent in the vroposes al data 
validatior system concept are lcw deveci mE co ama 
speedy implenentation. In this light, initial DCSPLANS 
develcpment ecforts rust focus cn tuc oL CM 
prototype system that takes maximum advantage of existirg 
resources. Specifically, the DCS PECANS Or Dea ea uae 
incorporate the existing UIS NCS Seegcin ES 
relevart Rnlisted Master File (PME) Ara eee eee 
Ii data dictionary Which currenti m lud Ec 
and office metadata in its metadatapase, Arce co 

iS data 


DCSPLANS IBM PC nicrocomputer. 2110585950 a 
C based, 


Merc 
2 


ro 


filter systen theteicre uii! cela nocao 

DBASE II projran which filters El eat door viro 

application molels (two models must be used to test the 

system's ability to differentiate betwoen the Jehrees cI 
validaticn required Ey separate models using tbe same input 
data source tile). / 

The following sters suygest a elnololoc Or 
develorment of the initial DCSPRANS Panor. pe aa IU 
syster. 

1. Deternine and IfplemeLrt the DESEE INPUNE S 
chanism for feeding UTN caes 
tarough the IBM PC dita tac” 

Ze  Expan' current data Gilet ona ge acena : 
creating adiitionai metacacaba  vodules wirsng 
accept ani store metalata avout source file and model 
data elements. Create ar aciona! dave Neco an 7 
metadatarase module tnat will accept and stoze Bl: 

3.  Jsinj the Phase ILI cheeXkdlistM Suo Nm c tor 

ccrrrenhensiveiy corstruct datam O EE 
and model data elements, and create the SUR metadata 


whicn sets data element validation parameters and 


interrolationships. This step ag6t be acconTlisied 
yta tho tuli, constant cooperation of those DCSLLAWS 
DODSOIRHeJSA most I nu acquaint AN pit teem aq 


the two application ogdeis Lernj used for “ie 
BLOtTOL) pC. 

oa) tne data deilnition ani Mike Metadata into the 
cara Matet ionary metadata lase. 


Sie ne Lunet ondi rO Ules from zectiom 3 of thi 


U 


Chapter as a guide (particular Subotiom 4.1), 
create amscdit/validatior proram whic: will conmrol 
anc implement the overall data iliter process. 

The development methocolovy presented above is tased 
upon a limite! on-site review of DCSPLANS operations. A 
porc ceconoseSensivesexamination cfi the DCSPLANS enviroment 
( 


described in chapter four) will most likely uncover some 


uN 
O 
(o 


Phase 1 07r thecplannsng anecmweanitewel lesen process 


additional reguirements and necessary adjustments. 
Therefore a letaiied cn-site environaental review is an 
essential. rreneguisite to amv DCSPLAMS Sata rriter 
cevelopaent/laplenenñntatio: effort, especially one bel”; 


undertaker me nona DCSPLANS pezsentel. 


C 
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VI. CONCLUSIONS AND RECOMMENDATIONS 


A. “CONCLUS TORNS 


DCSPLANS, MILPERCEN sufzers [rom NG Ca Er O rO A 
copmon tc many smali user groups in large data processing 
systems. It is unable to veriiy thelcorrecin ds on 
data obtained from sourcos outside jts Ss JANO AE 
the present time, DCSPLANS must rely almost exciusively on 
tre the competence of its outside sources to guarantee the 
integrity oi its input data. Tre sitiat)ion cana 
DCSPLANS* managers a great deal oO NE USO 

Tov-level Army decision-makers use output £rom 1CsP ieee 
applications to formulate lomg-iaije personnel mdna euenit 
policies. Thus, the adverse impact of erroneous Input Ra 
entering DCSPLANS' models can LE far-reaching anh extrensiy 
serious.  Despite this fact, DCSPLANS' Eca lado cenas 
to the overall MILPSRCEN Informerior race ae eo 
precludes it Erom strcngly ineivenc tiie ascende ed 
system-wide validation Cavabinnity. pec SM NM cc 
develop and implement a Vocal" somo ee 
Vali catwecer BO oven. 

DCSPIANS' modelS and their associa Soc nn. 
contain many or tre same data tes A 
variety orf relationshirs exists a Om a DD non E ee 
Sitnation demands that PCSPLANS? Sige ee O a 
techniques to insure tre acer oc so. ese eee 
nodjeis. Ir addition to routine Corna ch E MEM 
reasonarieness checks are also Toc ie MEOR Varan eo se 
input is both coaplete anl consisteut. Reasonableness 


Checks are rore complex than the COS 


en 
No 


fact, thelrealmho» tc ingarião a top integrated validation 
process (1.6, data elenmputs, recorósióna filda dce not com r 


vo eE nemçe vos, Wut also in re:aticom to. Sener, rélevi it 


Pty 


elements, recors and files). Cf comme, miifatiol Of the 
lema lity anciproçer sequerciro cí qb impit activeey itse. = 
mist ET the validity checks ol tae data. 

TT T ET or DCSPLANS is the achive data 
eretrensry. Coni Ligured as andata filter, the dictionary 
provides a flexihle, user-friencly, ceısily exvaniable 
MA oe 1 area ena EHUsSer grou). The data filter 
can Te developed locally usinj the excertise currentiy 
AA NEC SS: Such iocal development allows 
mon ld tersa Der tallorel pnrecisely to ?CSPIASS' 
oem Validation needs. Tue Cata dictionary aporoach rerzzits 
qUe as y mptatier o: the data filter to cianges ln 
D T data soutce Ellées by simply aijhsti5J 
uo tion ae tada caco extersivo validetior program 
re-writes will be reuvirel, Also, the use of a metadataiase 
Jo Sao Somnrcemoi cata for ruil iing SVR crovides a 
Bear miscdresrIuemesor keeping tne 575 consistent. 

Pos nem data dictionary aliows JC5PLAVS to leveloo 
Lıture Mala mprocesshns tools/caçpabilities with reiktivce ease 
and mi ral investrentis oO: tine ami money. 

Der am nin uslcryucial to DCSPOARS’ successful 
deve loe hondata Lilter. ire overall DCSPLALS 
processing environment must be understood, ani data 


Go Ee wou ma sSocidtea Validation roquirements must >= 


ba 


conmerehensively “examine? anc catefully locurente?. Thorough 

accorp Uren tene so these first two phases DI deveiopezent 

O RS SS or bet preliminary anil detailed 
system esto no iniracy Jesigr shohl13 be hccor 
ELFO leona deconpositior oi major syste 
Funetlon sn an. 56 Cinctiors mist te derive.l 


t 
anal e Dor N One and two cestlts, am must satistvy tie 


achievenent of the specific 


us 
O 
£v 
ld 
Ui 


and key objectives Ber tre 


DOS ENS. SY SES 


B. RECCMMENDATIONS 


An 


~~r 


effective DCSELANS approach T 


problem must key on the concepts/designs presented in this 


thesis. 
ie 


It 1s recommended that: 
DCSPLANS pursue an efficient SE ca ME a e UC 
can be tailored to its sreci ic neet ratkon tiom 
awajt or attempt to àn1lvescelte morioriol a 
orgarization-vide valrtdatyon s Ip 
the locai solution applied 2 PO Pe e 


data dictionary "crta ride 


DCSPLANS begin developmert with a prototype System 
that will validate Enlisted Master File (Ei?) data 
L L 


for use in twc models. This approach tests 

system's ability to differentiate betreer the Cdesrees 
of validaticn required by difierent moleis using tie 
sane source data file, ard also takes aivantaye cr 
the existing CTRACS prograni (extracis Fel va eae 
data). The Lrototy»e s5éblv dove co Ra 
easy-to-use relational Jatabas Zzrrageer Ser 
with a Siaple query Lanny Va@emi gen EL LO 
panos 130. 

DCSPLANS appoirt a small profesa ceda Co dversco no 
data filter develooment. The team dust coaduct a 
thorougi on-site review cf DCSPLAvS ervicora 2 
characteristics ani Cake ciano IUE 
Criteria (Chapter Four) rris MLCUAM:)10)0 10 
general design (Chapter Fue Mad SUD e MENO COMO 
salle Jetailed Gesign anc con o oe 


off-site (perhaps as a thesis project), the review of 


54 


envorosmental characteristics, data definition, asd 
validation criteria. must De accomolieaclont DCSPSDANXMS 
Dy persorne Panic a tis EMOS oporatiois. me 
checklists in chabter four provide comprehensive 


guide MMES Or suchar examination. 
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